Deformable Part-based Fully Convolutional Network for Object Detection

نویسندگان

  • Taylor Mordan
  • Nicolas Thome
  • Matthieu Cord
  • Gilles Henaff
چکیده

Existing region-based object detectors are limited to regions with fixed box geometry to represent objects, even if those are highly non-rectangular. In this paper we introduce DP-FCN, a deep model for object detection which explicitly adapts to shapes of objects with deformable parts. Without additional annotations, it learns to focus on discriminative elements and to align them, and simultaneously brings more invariance for classification and geometric information to refine localization. DP-FCN is composed of three main modules: a Fully Convolutional Network to efficiently maintain spatial resolution, a deformable part-based RoI pooling layer to optimize positions of parts and build invariance, and a deformation-aware localization module explicitly exploiting displacements of parts to improve accuracy of bounding box regression. We experimentally validate our model and show significant gains. DP-FCN achieves state-of-the-art performances of 83.1% and 80.9% on PASCAL VOC 2007 and 2012 with VOC data only.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deformable ConvNet with Aspect Ratio Constrained NMS for Object Detection in Remote Sensing Imagery

Convolutional neural networks (CNNs) have demonstrated their ability object detection of very high resolution remote sensing images. However, CNNs have obvious limitations for modeling geometric variations in remote sensing targets. In this paper, we introduced a CNN structure, namely deformable ConvNet, to address geometric modeling in object recognition. By adding offsets to the convolution l...

متن کامل

Convolutional Gating Network for Object Tracking

Object tracking through multiple cameras is a popular research topic in security and surveillance systems especially when human objects are the target. However, occlusion is one of the challenging problems for the tracking process. This paper proposes a multiple-camera-based cooperative tracking method to overcome the occlusion problem.  The paper presents a new model for combining convolutiona...

متن کامل

Region-based Deformable Fully Convolutional Networks for Multi-class Object Detection at Signalized Traffic Intersections

Multi-class object detection is critical for intelligent traffic monitoring applications in smart cities as well as connected autonomous vehicles. Although, numerous research works evaluate the performance of image processing algorithms for onvehicle cameras, the body of research evaluating performance of image processing of stationary cameras located near intersections is limited. In this rese...

متن کامل

A Two-Dimensional Convolutional Neural Network for Brain Tumor Detection From MRI

Aims: Cancerous brain tumors are among the most dangerous diseases that lower the quality of life of people for many years. Their detection in the early stages paves the way for the proper treatment. The present study aimed to present a two-dimensional Convolutional Neural Network (CNN) for detecting brain tumors under Magnetic Resonance Imaging (MRI) using the deep learning method. Methods & ...

متن کامل

Object Detection with Mask-based Feature Encoding

Region-based Convolutional Neural Networks (R-CNNs) have achieved great success in the field of object detection. The existing R-CNNs usually divide a Region-of-Interest (ROI) into grids, and then localize objects by utilizing the spatial information reflected by the relative position of each grid in the ROI. In this paper, we propose a novel featureencoding approach, where spatial information ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1707.06175  شماره 

صفحات  -

تاریخ انتشار 2017